Analysis of airplane boarding via space-time geometry and random matrix theory 
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We show that airplane boarding can be asymptotically modeled by 2-dimensional Lorentzian 
geometry. Boarding time is given by the maximal proper time among curves in the model. Discrep- 
ancies between the model and simulation results are closely related to random matrix theory. We 
then show how such models can be used to explain why some commonly practiced airline boarding 
policies are ineffective and even detrimental. 



Airplane boarding is a process experienced daily by 
millions of passengers worldwide. Airlines have devel- 
oped various strategies in the hope of shortening board- 
ing time, typically leading to announcements of the form 
"passengers from rows 40 and above are now welcome 
to board the plane" , often heard around airport termi- 
nals. We will show how the airplane boarding process 
can be asymptotically modeled by spacetime geometry. 
The discrepancies between the asymptotic analysis and 
finite population results will be shown to be closely re- 
lated to random matrix theory (RMT). Previously, air- 
plane boarding has only been analyzed via discrete event 
simulations ^0,0- 

We model the boarding process as follows: Passen- 
gers 1, N are represented by coordinates Xj = (qi,Ti), 
where % is the index of the passenger along the board- 
ing queue (1st, 2nd, 3rd and so on), and r is his/her 
assigned row number. We rescale (q,r) to [0,1] x [0,1]. 
It is assumed that the main cause of delay in airplane 
boarding is the time it takes passengers to organize their 
luggage and seat themselves once they have arrived at 
their assigned row. The input parameters for our model 
are: 

u - Average amount of aisle length occupied by a pas- 
senger. 

w - Distance between successive rows. 

b - Number of passengers per row. 

D - Amount of time (delay) it takes a passenger to 
clear the aisle, once he has arrived at his designated row. 

p(q, r) - The joint distribution of a passenger's row 
and queue joining time. p(q, r) is directly affected by the 
airline policy and the way passengers react to the policy. 

For the purposes of presentation, we shall assume that 
u, w, b, D are all fixed. The airplane boarding process 
produces a natural partial order relation of blocking 
among passengers. Passenger X blocks passenger Y if it is 
impossible for passenger Y to reach his assigned row be- 
fore passenger X (and others blocked by X) has sat down 
and cleared the aisle. Airplane boarding functions as a 
peeling process for the partial order defined by the block- 
ing relation. At first, passengers who are not blocked by 
others sit down; these passengers are the minimal ele- 
ments under the blocking relation. In the second round, 



passengers who are not blocked by passengers other than 
those of the first round are seated, and so forth. Board- 
ing time thus coincides with the size of the longest chain 
in the partial order. 

We assign to the boarding process with parameters 
u,b,w, D,p(q,r) a Lorentz metric defined on the (q,r) 
unit square by 



ds 2 



AD 2 p(q, r)(dqdr + ka(q, r)dq 2 ), 



(1) 



where k = bu/w and a(q,r) — p(q, z)dz. There are 
two properties of the metric which relate it to the board- 
ing process: 

• The volume form of the metric is proportional to 
the passenger density distribution p(q, r). 

• The blocking partial order among passengers dur- 
ing the boarding process asymptotically coincides 
with the past-future causal relation induced by the 
metric on the passengers, viewed as events in space- 
time via their q, r coordinate representation. 

To establish the second property, consider passengers 
represented by X — (q,r) and X' = (q + dq,r + dr), 
dq > 0. Consider the time passenger X arrives at his des- 
ignated row. All passengers with row numbers beyond r, 
who are behind X in the queue but in front of X' , will 
occupy aisle space behind X. The number of such passen- 
gers is roughly Nadq. Each such passenger occupies u/w 
units of aisle length, where we take the basic aisle length 
unit to be the distance between rows. The row differ- 
ence between X and X 1 is —(N/b)dr. We conclude that 
passenger X is blocking passenger X\ via the passen- 
gers which are behind him, roughly when dq > —akdr, 
a condition which coincides (together with dq > 0) with 
the causal relation induced by the metric. By the two 
main properties we may approximate asymptotically the 
airplane boarding process by the peeling process applied 
to the past-future causal relation on points in the as- 
sociated spacetime, sampled with respect to the volume 
form. By a well-known result, 2-dimensional Lorentzian 
metrics are conformally fiat, and hence, after an appro- 
priate coordinate transformation, we may assume that 
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the spacetime is given by a metric of the form 

ds 2 = r(x, y)dxdy (2) 

on some domain (not necessarily the unit square). In 
the new coordinates x, y, which are lightlike coordinates, 
chains in the causal relation coincide with increasing (up- 
right) subsequences, namely, sequences of points (xi,yi) 
such that X{ < Xj and y, L < yj for i < j. The peeling 
process applied to the causal relation coincides in this 
case with patience sorting which is a well-known card 
game process, which computes the longest increasing sub- 
sequence in a permutation jj, |5| . 

Denote by T{X) the maximal proper time (integral 
over ds) of a timelike trajectory ending at X = {q, r) and 
by L(t) the length (integral over \/~ds 2 ) of the spacelike 
curve defined by the equation T(X) = r. 
Using the analysis of the size of maximal increasing sub- 
sequences given in 0, Q , the two basic properties lead to 
the following modeling statements. 

(A) The boarding time of passenger X$ is approxi- 
mately y/NT(Xi). In particular, the total boarding time 
is approximately y/~Nd, where d = MaxxT(X) is the 
maximal proper time of a curve in the unit square with 
respect to the Lorentzian metric. 

(B) Let N(t) be the number of passengers with board- 
ing time at most \^Nt and N(t) = N(t)/N. Then, 
dN(r) = ^L(r)dT. 

Here the word "approximately" is used to mean that 
the ratio between the two quantities tends to 1 with prob- 
ability 1 as the number of passengers N tends to infinity. 

We apply statement (A) to the analysis of boarding- 
times. Consider first the case where the airline does not 
have a boarding policy, namely, passengers queue at uni- 
formly random times, so that p(q,r) = 1, a = 1 — r and 
the corresponding metric is 

ds 2 =Adq(dr + k(l-r)dq). (3) 

We use this model to study the effect of airplane design 
parameters such as the distance between rows and the 
number of passengers per row on boarding time. These 
parameters affect boarding time through the parameter 
k = — . To find the maximal proper time curve, we solve 
the Euler-Lagrange equation for proper time subject to 
the constraints of lying in the unit square. In figure 1 the 
maximal proper time curve is plotted for several values 
of the parameter k. For k < In 2 the curve is contained 
in the interior of the square and is therefore a geodesic. 
The length of the curve is 

d(k)=2^^. (4) 

For k > In 2 the maximal curve "crawls" at first along 
the g-axis until reaching a point B — (q(k), 0) such that 
the geodesic between B and A = (1,1) has a vanishing 
derivative at B. We have 

d{k) = 2Vl+2(l -ln2)/\/fc. (5) 
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k=0 . 5 



sim. for 10"6 pas. with k=0 . 5 

sim. for 200 pas. with k=0 . 5 

sim. for 100 pas. with k=0.5 

FIG. 1: Comparison of the maximal curve for k — 0.5 with 
simulation based maximal chains. 
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k=5 

sim. for 10"6 pas. with k=5 

sim. for 200 pas. with k=5 

sim. for 100 pas. with k=5 

FIG. 2: Comparison of the maximal curve for k — 5.0 with 
simulation based maximal chains. 



We compared the above computations with simulations 
of the boarding process. In Figures 1 and 2 we show, 
for k — 0.5 and k = 5, the computed maximal curve 
along with maximal chains obtained from simulations 
with N = 100, N = 200 and N — 10 6 . As can be 
seen from the figures, when N = 10 6 , the longest chain 
clusters along the computed curves. The length of the 
maximal chain also matches well the estimated values de- 
rived from the spacetime model. For k = 0.5 the length 
of the maximal chain in the simulation was 2261, while 
the estimated value is 2278, while for k = 5 the corre- 
sponding values were 4589 and 4740, respectively. For the 
more realistic values of N = 100, 200 passengers, there 
are substantial deviations of the maximal chain from the 
expected curve. Tabic II presents the average boarding 
time results for 1000 simulations of the boarding pro- 
cess for several settings of k and N. We also provide 
the boarding time estimate computed via the spacetime 
model. Upon inspection, Table II shows that, for realis- 
tic values of N, there are substantial differences in the 
range of 20-60 percent between the asymptotic board- 
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0.5 


100 


18.1 


22.8 


0.5 


200 


26.8 


32.2 


2.0 


100 


23.0 


32.5 


2.0 


200 


34.7 


45.8 


5.0 


100 


29.0 


47.5 


5.0 


200 


44.9 


66.9 



delta' 

TABLE I: A comparison of space-time model estimates with 
the average over 1000 simulations of boarding time results 



ing time estimates computed via Lorentzian geometry 
and the boarding time computed via simulations of the 
boarding process. These large differences correspond to 
the large deviations of the maximal chains from the ex- 
pected curves as seen in Figures 1 and 2. 

We also note that in all cases the Lorentzian estimate 
is larger. 

Let Ljv be the random variable representing the board- 
ing time according to the boarding process model (the 
simulation results). We define the discrepancy random 
variable Ajv = Ln — y/~Nd(k), which measures the differ- 
ence between the boarding time and the Lorentzian esti- 
mate. The curvature of the metric given in vanishes, 
and therefore we can apply a coordinate transformation 
W which changes the metric to the form 

dr 2 = Adxdy. (6) 

As noted previously, in this case past-future causal chains 
correspond to increasing sequences of points. We also 
note that, since the spacetime points are sampled accord- 
ing to the volume form, they are uniformly distributed. 
The discrepancy Ajv has been studied in the context of 
increasing subsequences of uniformly distributed points 
in a rectangle with sides parallel to the coordinate axis 
p|, and for a right angle triangle with sides parallel to 
the coordinate axis 0. In both cases, the discrepancy 
has order of magnitude N 1 ^ 6 . In the case of the rect- 
angle the normalized discrepancy is given asymp- 
totically by the Tracy- Widom distribution F 2 |l(j , which 
measures the normalized discrepancy of the largest eigen- 
value of an N x TV matrix in the Gaussian unitary en- 
semble (GUE) [13, in comparison with 2%/iV. For N/2 
uniformly distributed points in a right angle triangle with 
sides parallel to the axis, the normalized discrepancy is 
given asymptotically by the Tracy- Widom Distribution 
Fi , which is the normalized discrepancy of the largest 
eigenvalue in the Gaussian symplectic ensemble (GSE). 
The averages for these distributions are E{F2) = —1.77 
and E{F 4 ) = -2.3. 

Let A = W{0,0) and B = W(l,l). Let U be the 
image under W of the unit square. Let R be the rect- 
angle with sides parallel to the x and y axis and corners 
A and B, and denote by T the above diagonal triangle 
in R. U is contained in R, and when k < ln 2 it con- 
tains the above diagonal triangle T. The ratio of volumes 
Vol(R)/Vol(U) equals 6 ^T 1 for all fc, and thus, applying 
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FIG. 3: Average discrepancy for k = 3 as a function of N in 
Log-Log scaling. 

the estimates from || and we see that for k < ln2, 
as N becomes large we have 

F 4 (z) < P(A N < (L-I± N )i/* z )Q < F 2 (z) (7) 
In particular, we obtain the finer estimate 

- 2 -\— N ) - E(A N ) < -1.77 \ ~~j^~ N J 

(8) 

Looking at the results from Table II with k = 0.5, we 
see that the refined estimate (JHJ holds already for the 
realistic values N = 100,200; indeed 17.7 < 18.1 < 18.8 
for N = 100 and 26.4 < 26.8 < 27.8 for N = 200. When 
k > ln 2 the maximal curve contains a portion of the 
bottom edge of the unit square. Using the methods of 
[l2|. it can be shown that in such cases |Ajv| >> N 1 / 6 . 
However, we do not know how to compute analytically 
the order of magnitude of the error. Figure 3 shows the 
behavior of Ajv for k — 3 in Log-Log coordinates. As 
can be seen, the graph is essentially linear, and linear 
regression suggests the formula 

E{A N ) - -4.85N - 222 . (9) 

The spacetime metrics given by Q can be used for 
comparing different boarding policies. Boarding policies 
such as "passengers from row 40 and above board first, 
followed by the rest of the passengers" effect the passen- 
ger distribution function p(q, r) . We compared the results 
of the spacetime computations (without finer asymptotic 
corrections) with the results of detailed event driven sim- 
ulations of boarding processes 0, , which compare dif- 
ferent boarding policies. We computed spacetime based 
estimated boarding times for 25 different boarding poli- 
cies with parameter k = 4. Results based on detailed 
trace driven simulations for the same policies are re- 
ported in 0- A comparison of the results shows that 
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the spacetime estimates are in almost complete agree- 
ment with the trace driven simulation results regarding 
the ranking of the different policies. The correlation fac- 
tor between the 25-dimensional boarding time vectors is 
0.97. This is somewhat surprising given that the trace 
driven simulations in Q take into account many details 
of actual boarding processes which are not considered by 
our boarding process simulation. These include, among 
others, walking speed of passengers, passengers sitting 
at the wrong row and full overhead bins. The large dis- 
crepancies noted previously between the boarding simu- 
lation and the spacetime estimates are less of a factor, 
since when comparing boarding strategies only ratios of 
boarding times matter and these are less affected by the 
discrepancies. The main findings regarding actual board- 
ing policies are: 

- The commonly practiced back-to-front boarding poli- 
cies, which attempt to board passengers from the back 
of the airplane first, are ineffective for realistic values of 
3 < k < 5. The intuition behind this statement can 
be seen by the following simple reasoning. Assume the 
airline is perfectly successful in enforcing back-to-front 
boarding and thus passengers from the last row to board 
first, followed by the passengers from row m — 1 and so 
on. If k < 1, then passengers from all rows can sit con- 
currently without interference since they do not block 
each other. If k > 1 then the passengers in row m — 1 
have to wait until at least some of the passengers from 
row m have sat down, and similarly passengers from row 
to — j — 1 have to wait for passengers from row m — j to 
sit. This leads to a linear sized chain of blocking, which is 
much larger than the y/N -sized chains in random board- 
ing. 

- Among row dependent policies which do not severely 
constrain passengers, random boarding (no policy) is al- 



most optimal. 

- One can improve any row dependent boarding policy 
(including random) by first allowing window seat passen- 
gers to board, followed by middle seat and finally aisle 
passengers. Such policies lower the delay parameter D, 
which affects the metric via scaling. 

There are other discrete random processes which can 
be modeled via 2-dimensional Lorentzian geometry in a 
similar manner. One example of interest is the Polynu- 
clear growth (PNG) model, which is a particularly simple 
(1 + l)-dimensional growth process 0, The con- 

struction of the present paper can be considered as gen- 
eralizing the mapping of Prahofer and Spohn |l3j | of the 
basic PNG model to permutations. In this mapping the 
height of the PNG droplet corresponds to the length of 
the longest increasing subsequence in a random permuta- 
tion. In terms of the airplane boarding process this cor- 
responds to setting k = 0. More generally the methods of 
this paper provide a description of the macroscopic shape 
of a PNG droplet in an environment with non-uniform 
nucleation rates and lateral speeds. 

A different example is the process of scheduling I/O 
requests to a disk drive [Tj, • 

In conclusion, we have shown that the airplane board- 
ing process can be asymptotically modeled by a boarding 
parameter dependent two dimensional spacetime. The 
model can be used to analyze the dependence of boarding 
times on the various boarding parameters and boarding 
policies. The discrepancy between the asymptotic model 
and finite population results is closely related to RMT, 
at least for thin passengers. The analysis carries appli- 
cations to the design of good airplane boarding policies. 

We are grateful to Perci Deift, Ofer Zeitouni and Jinho 
Baik for very useful discussions. 
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